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Abstract It is well known that individuals can route messages on short 
paths through social networks, given only simple information about the 
target and using only local knowledge about the topology. Sociologists 
conjecture that the routes are found greedily by passing the message 
to an acquaintance that has more in common with the target than the 
current holder of the message, e. g. if a dentist in Saarbriicken wants 
to send a message to a specific lawyer in Munich, he may forward it to 
someone who is a lawyer and/or lives in Munich. Modelling this setting, 
Eppstein et al. introduced the notion of category-based routing. The goal 
is to assign a set of categories to each node of a graph such that the 
categories are connected, greedy routing is possible, and the maximal 
number of categories a node is in is minimal. We give an improved upper 
bound construction and show lower bounds for stars, trees, and general 
graphs. 



1 Introduction 

In the 1960's, Milgram (l}|3] observed 
the small world phenomenon, i.e. that 
short paths seem to connect us all in 
the social graph in which the vertices 
represent persons and two persons are 
connected by an edge if and only if they 
know each other. He performed experi- 
ments in which he asked randomly se- 
lected participants to relay letters across 
the USA by passing them to one of their 
direct acquaintances. 

The participants only had simple 
categorical information about the tar- 
get, such as name, location, and occu- 
pation, and knowledge about their own 
acquaintances, that is, they knew the 
local topology of the network. The mes- 
sages arrived typically after only six 
hops. 



Perhaps even more surprising than 
the mere existence of short paths in so- 
cial networks is the apparent ease with 
which humans can discover them, de- 
spite having only little information. Ex- 
periments by sociologists f4'-'7| indicate 
that we use a simple greedy heuristic 
to route. The message is passed on to 
the acquaintance that is most similar 
to the target. 

In [s], Eppstein et al. model this set- 
ting as a graph G — (V, E), where peo- 
ple act as vertices and whose edges rep- 
resent pairwise acquaintance, together 
with a system of categories S C V{V). 
Each category C G 5 is the vertex set of 
a connected subgraph of G. For two ver- 
tices s and t, let the distance from stot 
be the number of categories containing 
t but not s, i. e., 

d{s,t) = |{C e 5 I i e C and s ^ G}\ . 



A system of categories S supports greedy 
routing in G (is good for G) if for any 
two vertices s and t, there is a neigh- 
bor M of s with d{u,t) < d{s,t). For a 
system of categories S, Eppstein et al. 
define its membership dimension as the 
maximal number of categories to which 
any vertex belongs: 



memd(iS) 



max|{Ce 5 \ v e C}\ 



The goal is then to construct a system 
of categories supporting greedy routing 
and having small membership dimen- 
sion. Membership dimension captures 
the cognitive load of the participants, 
i. e. the number of categories an actor 
must keep track of in order to decide 
on the next node of the route. 

Eppstein et al. fs] show the existence 
of a good system of categories S with 

memd(5) e O ((diam(G) + log \V\f) 

and note a lower bound of diam(G); 
here diam(G) denotes the diameter of 
G. 

We substantially improve on the up- 
per bound stated above and establish 
new lower bounds. We review related 
work in Sect. [2| introduce notation in 
Sect.|3j and prove exact bounds for lines, 
grids, and tori in Sect. [4j In Sect. [5] we 
construct for every graph G a good sys- 
tem of categories S with 



2\V\ 



memd(5) = O [d ■ log 



where d — diam(G). This bound im- 
proves upon the bound of Eppstein et 
al. except for diam(G) = ©(log In 
Sect. |6] we show an almost matching 
lower bound. We exhibit for all nonnega- 
tive integers n and d a graph with l-\-nd 
vertices and diameter 2d for which every 



good system of categories has member- 
ship dimension 



n 



dH\V\ld) \ 
\n{dH\V\/d))) 



In Sect. [7] we show that every good sys- 
tem S for a graph G with average degree 
5 has membership dimension 



n diam(G) 



log 1^1 
log (5 



in particular, bounded degree graphs 
require logarithmic membership dimen- 
sion. The bound is best possible. For 
each triple (n, 5, diam) of positive reals 
with 1 < (5, diam < n we exhibit a graph 
with 0{n) vertices, average degree 0{5), 
and diameter ©(diam) for which a good 
system of membership dimension 



O diam 



log 1^1 
log (5 



exists. 



2 Related Work 

We essentially paraphrase the analogous 
section in js]. 

Greedy Routing is a well-studied 
technique with many applications. A 
variety of methods are known, for ex- 
ample geographical information as an 
aid for routing has been explored (9|[l0] . 
This method does not succeed on all 
networks, so a number of enhancements 
have been developed [TT 13 . Several 
groups also examine succinct greedy- 
routing strategies that limit the addi- 
tional information at every vertex to 
be logarithmic in the size of the net- 
work [14 17 . Category-based routing 
with logarithmic size membership di- 
mension is a special case of succinct 
greedy routing. 



A category-based approach has been 



studied by Mei et al. 18 as a heuristic 



for routing in dynamic networks. Nodes 
are assumed to be organized into catego- 
ries by the user. Experimental evidence 
indicates that category-based routing 
outperforms heuristics based on loca- 
tion or random choices. Eppstein et 
al. f8| were the first who studied it from 
a complexity-theoretic viewpoint. We 
reviewed their results in the introduc- 
tion. 

A different approach to routing in 
social networks was studied by Klein- 
berg |19[ . He focuses on location in- 
stead of categorical information to ex- 
plain how we find short routes efficiently. 
Based on his insights he constructs a 
random graph model that has similar 
properties and shows for which parame- 
ters routing is possible. In contrast the 
approach of Eppstein et al. seeks to 
construct a system of categories that 
enables greedy routing for a given net- 
work. 

3 Preliminaries 

Unless stated otherwise, we consider 
undirected graphs G = {V, E) with n = 
\V\ nodes. For two nodes u &V , v 
let dist(u, v) be the number of edges on 
a shortest path between u and v. Then 
define the diameter of a graph G as 



diam(G) 



max distfu.w) 



that is. the diameter is the length of a 
longest shortest path in G. 

Let 5 C 7^(1/) be a system of sub- 
sets of the vertices of G that induce 
connected subgraphs. For a node u € U 
define cat(M) to be the set of categories 
to which u belongs 

cat(u) :^{C eS\ueC}. 



The membership dimension memd(iS) 
is the maximum number of categories 
to which any node belongs, that is, 

mcmd(5) ;= max |cat(u)| . 

We say a system of categories sup- 
ports greedy routing in G or is good 
for G if a greedy routing algorithm suc- 
ceeds. If the message addressed to node 
t Cz V is currently in node u € V and 
u ^ t, the algorithm forwards it to a 
neighbor v G N{u) of u that is closer to 
t according to the distance function 

d{v,t) :— |cat(i)\ cat(w)| , 

The algorithm succeeds if for all u G y 
there is a neighbor v G N{u) such that 
d{v,t) < d{u,t). 

4 Simple Bounds 

Already in Eppstein et al. |8 , it is ob- 
served that the diameter bounds the 
membership dimension from below. 

Lemma 1. For any graph G and good 
system of categories S we have 

memd(5) > diam(G). 

Proof. See Appendix, Proof [T] 

For paths this bound is tight, cf. the 
construction in Fig. [T] We can extend 
this observation to all graphs that can 
be obtained from paths and cycles by 
taking cross products. 

Definition 1. LetG = {V,E), and H ^ 
{V ,E'), he two graphs. Then the cross 
product G X H is the graph (V, E) with 

V xV, 

E = {{{u,x),{v,y)} I 

{{u,v} e E /\x ^ y)V 
i{x,y} e E' Au = v)}). 



( • c c 



Figure 1. Minimal categories for routing 
from tlie leftmost vertex to the rightmost. 
Adding the symmetric categories allows to 
route between any pair of vertices. Every 
vertex is contained in exactly four catego- 
ries. 



Lemma 2. Let M{G) he the minimal 
membership dimension needed to route 
in G. Then 

M{G X H) < M{G) + M{H). 

Proof. See Appendix, Proof |2] 

Lemmas ^ and [2] immediately give 
the following tight bounds. 

Corollary 1. For grid graphs G we get 
M(G) = diam(G). □ 

Corollary 2. For hypercubes G we get 
M[G) = diam(G) = [logn] . □ 

Corollary 3. For tori G — CkX Ci we 
have M{G) = \k/2\ + \l/2'\. □ 

5 Improved Upper Bound 

We construct for every graph G a good 
system of categories S with membership 
dimension 

'^(^^^-(^)^°^di^)- 

It suffices to prove the bound for trees. 
For general graphs, we construct a span- 
ning tree of diameter diam(G) and route 
in the spanning tree. 

Lemma 3. For any tree on n nodes 
with diameter d there is a system of 
categories S of membership dimension 

memd(5) = O{dlog{2n/d)). 



We conjecture this bound to be tight. 
An example might be the star with de- 
gree n/d sending out paths of length d 
as shown in Fig. |6] 

Proof For a tree T let r G V(T) and 
consider a triple {r,L,R) with L,R & 
partitioning of the neighbors of r in 
T. After deleting r, the tree falls into 
components; take the ones containing 
nodes from L. To these components add 
r again (making it a neighbor to all 
nodes in L) to get a tree = TL(r). 
Build Tr symmetrically. This cuts T 
into two trees; each vertex is in exactly 
one of Tl and T^, except for r, which 
belongs to both trees. 

Now, we call (r, L, K) a balanced 
routing cut if both \V{Tl) \ and \V{Tb)\ 
are at least ci • and at most 

C2 • |y(r)|. A basic graph theoretic argu- 
ment shows that any tree has a balanced 
routing cut for constants ci = 1/3 and 
C2 — 2/3, see for example 



20 



Given a tree T we now construct 
categories as follows. Take a balanced 
routing cut (r, L, R) of T. Similar to 
Eppstein et al. [s], we construct catego- 
ries that allow routing from any vertex 
in Tfi all the way to r when having a 
target in (and symmetrically), see 
Fig. [2] Then we recurse on Tr and T^. 
After that, we unify some categories to 
decrease the membership dimension|^ 

The base case of this procedure is 
a graph of constant size, where we add 
any valid system of categories of con- 
stant size. Observe that this way we 
construct a valid system of categories 
for T. 



^ The recursion means that we again find 
a balanced routing cut in Tr, so the 
vertex we split at in Tn does not have to 
be r or in R. This is a major difference 
to Eppstein et al. 





(a) The categories 
before recursing. 




(b) The categories 
for two nodes. 




Figure 2. Categories that allow routing (c) Categories after (d) The merged ca- 
from Tr to r for a target in Tl- recursing. tegories. 



We now describe this construction 
in more detail. Let = dist(i;,r) be 
the distance from v e V(T) to r in Tj^ 
For routing from to Tj^ we add the 
categories V{Tl) U {v (E V{Tr) \ < 
fc}, for < fc < diam(T7?)0We add 
symmetric categories for routing from 
Tl to Tr. Note that these categories 
allow us to route from any vertex in Tr 
to r when having a target in Tl . 

Then we recurse on Tr and T^. 

After that, we change the system of 
categories slightly to decrease its mem- 
bership dimension. We have split at 
(r, T, R) and in the recursion on Tr we 
constructed, say, categories Ri,. . . ,Ra 
containing r and in the recursive call to 
Tl we constructed categories Li, . . . , Lh 
containing r. Assume wlog. a < b. Then 
we can replace the i?,; and Lj by the ca- 
tegories 

{i?, U T, I 1 < i < a} U {La+i, Lb}. 

These categories are still connected and 
greedy routing is still possible, as the Ri 
{Lj) were only needed to route inside 
Tr (Tl). See Fig. [| 

^ or in Tn or Tl, there is no difference if 

u is in that tree. 
^ This is the same construction as in Epp- 

stein et al. [8| 



Figures. Merging of categories after re- 
cursing. 



This ends the construction of catego- 
ries. Observe that we construct a valid 
system of categories. 

It remains to bound the membership 
dimension of the constructed system of 
categories. Consider any node v e V{T). 
Joining categories as described in the 
third part of our construction implies 
that the number of categories contain- 
ing V equals the maximal number of ca- 
tegories containing v constructed on any 
path in the recursion tree. As we select 
balanced routing cuts, in each recursive 
call of the construction the number of 
nodes of the subproblems is decreased 
by a constant factor and hence each 
such path has length 0{\og{n)). 

Consider one such path. On level 
i, < i < C'(log(n)), we considered a 
subproblem T^ on at most n ■ nodes. 
The diameter of Ti is bounded from 
above \yy d — diam(T), as Ti is a subtree 
of T. Moreover, the diameter of Ti is 
bounded by its number of nodes. Hence, 

diam(Ti) < minjd, n ■ C2}. 

Our procedure cuts Ti and adds some 
categories to allow routing over that 



cut. Observe that the number of such 
categories is bounded by O {dia.m{Ti)) . 
Thus, the number of categories contain- 
ing V can be bounded from above by 
(omitting tlie OQ) 

0(log(n)) 

^ mm{d,n- c^}. 

i=Q 

We can bound this sum as 

-logc2("/<i) 0(log(n)) 

i=0 i=-log^^{n/d) + l 

which simpHfies to 

dlog{n/d) +d^ 0{d\og{2n/d)). 

a 

6 Stars 

In this section we prove upper and lower 
bounds for stars. The lower bound for 
stars nearly matches the upper bound of 
the preceding section. A star of diame- 
ter 2d and £ leaves has l+£d nodes. The 
center node has degree £ and each leaf 
is joined to the center node by a path of 
length d. Fig. |4] shows a star with four 
leaves and diameter 2. We start with 
simple upper and lower bounds. 

Lemma 4. In a star with £ leaves, the 
center node is contained in at least log £ 
categories. 

Proof. For every ordered pair {i,j) with 
i ^ j, there must be a category C con- 
taining j and the center, but not i. Oth- 
erwise, every category containing the 
center and j also contains i and hence 
the set of categories containing {j, c} is 
the same as the set of categories con- 
taining {i,j,c}. Thus the distance from 



i to j does not decrease along the edge 
ic, a contradiction. We conclude that 
the number of categories containing the 
center is at least \og£. 

An alternative proof can be found 
in the Appendix, Proof [3] □ 

The same argument establishes: 

Lemma 5. In a tree, each node v is 
contained in least logdegw categories. 

a 

Lemma 6. For a star with £ leaves and 
diameter 2, there is a system of catego- 
ries S with membership dimension 

memd(5) < 1 + 2 [log^] . 

Proof. Let the leaves be numbered from 
1 to £. Every leaf forms a category of 
its own. For every i, < i < [log^] 
we have two categories Zi and Of. Zi 
contains the center and all leaves that 
have a zero in the i-th bit of their binary 
representation; Oi contains the center 
and all leaves that have a one in the i-th 
bit of their binary representation. See 
Fig. |4j Clearly every node is contained 
in at most 1-1-2 [log^] categories. 

Consider any two leaves u and v 
and assume we want to route from u 
to V. Let k be the number of positions 
in which the binary representation of u 
and V differ. Then d{u, v) = 1 + k > 2 
and d{c,v) — 1. Thus we can success- 
fully route from u to v. □ 

We next improve on both bounds. 

Lemma 7. Let k be minimal such that 




Then k = log -}-o(l)) loglog n. 

There is a system of categories S 
that supports routing in stars with n 
leaves and diameter 2 having 




Figure 4. Categories Zi and Oi for a star 
witii £ = 4. 



nienid(5) = fc. 

T/iis bound is tight, i.e., every good sys- 
tem of categories S for this graph has 

nienid(5) > k. 

Proof. See Appendix, Proof |4] 

Similar techniques apply for stars 
with larger diameters. 

Lemma 8. For any star with £ leaves, 
in which every leaf is connected by a 
path of length d to the center, there is a 
system of categories S with membership 
dimension 

mcmd(5) G O(dlog^). 

Proof. The Lemma follows from an ap- 
plication of the algorithm from LemmajSj 
For an alternative proof see Appendix, 
Proof m □ 

We proceed to show a lower bound. 
A key ingredient of our proof is the 
following problem of covering the set 

Q = {{^J) \l<^,j<e, i^j} 



by t rectangles, i.e., 

i<fe<t 

Clearly 5^ n Tfc = in any covering of 
Q by rectangles. We will first prove a 
lower bound for the rectangle covering 
problem and then obtain a lower bound 
for membership dimension by a reduc- 
tion to the rectangle covering problem. 
The intuition for the reduction is that 
every category Ck corresponds to a set 
Sk X Tfc such that (i, j) & SkX-Tk if Ck 
allows us to route between i and the 
center when having j as target. Hence 
by bounding the number of rectangles 
needed to cover Q, we bound the num- 
ber of categories that need to contain 
the center in a star with n leaves. 

Lemma 9. If A\nt < we have 

^"^^■I^32hr7 

fe 

in any covering of Q by t rectangles. 

Proof. See Appendix, Proof [6] 

We now consider a star with I leaves 
in which the arms have length d. Let 

7' = {P(1),...,P(*)} 

be the family of t categories containing 
the center. We will lower bound t using 
the results from Lemma [9l 

Every category containing the cen- 
ter corresponds to a partition 

P=[P^,P^,...,Pa) 

of [0, . . . , where Pk contains the leaves 
for which a path of length k from the 
center to the leaf is contained in the 
category. 



Lemma 10. For every i and j with 
1 5: * 7^ j 5: ^ O'l^d, every k, < k < d, 
there must be a P Cz V such that 

{i e Pk) ^ (j e Pfe). 

Proof. There must be a P such that 
i G Pk and j € Pd as otherwise we 
cannot route from i to j. Since P is a 
partition, j ^ P^. □ 

Lemma 11. Let t he the number of ca- 
tegories containing the center in a star 
with I leaves in which every leaf is con- 
nected to the center by a path of length 
d. Then 

dint 

tint > . 

- 32 

Proof. This proof uses a double count- 
ing technique. For each 1 < i < £ and 
k, define the vector v^ j by 

- e (P«)fe). 

Then by Lemma |10| for every k, the 
vectors in {vi^k | 1 ^ * ^ ^} are pairwise 
distinct and hence by Lemma|9]for every 
fc, < fc < d, 



i<i<t 



> 



line 

321ni' 



Summation over k yields 
d-i t 



32 \nt 



t d-l 
3 = 1 fc=0 

< t£. 



□ 



Theorem 1. Consider a star in which 
each of the t leaves is connected to the 
center by a path of length d and let t be 
the number of categories that contain 
the center. If i > S, 

d\n£ 

t > 



Proof. Assume otherwise, and let X = 
dlni. Since tint is an increasing func- 
tion in t, Lemma |l 1 1 implies 



X 



321nX 
and hence 

In 



In 



X 



32\nX 



X 

> — 
- 32 



X 



32lnX 
a contradiction. 



> InX, 



□ 



7 A Universal Lower 
Bound 

The lower bounds of the preceding sec- 
tion are existential. We showed the ex- 
istence of graphs for which every good 
system of categories has a certain mem- 
bership dimension. The lower bound al- 
most matches the universal upper bound 
of Sect. [5] In this section, we show a 
universal lower bound: every good sys- 
tem of categories for a graph G has 
membership dimension 



log 1^1 
log (5 



f2 diam(G) 



32(lnd-|-lnln^)' 



where 6 is the average degree. We also 
show that this bound is best possible. 

Theorem 2. In a graph G = {V,E) 
with average degree d/2, there is a node 
that is contained in 

!7(diam(I?) -|- log n/ log 5) 

categories. 

Proof. The lower bound of diam(D) was 
already established by Eppstein et al. 
We turn to the second bound. Since 
G has average degree 5/2, there are at 
least n/2 nodes that have degree less 
than S. Among these we can greedily 



find an independent set I of size at least 
f2{n/5). Consider the graph 



G'= {V{G),E{G)\J 



i. c, G where the subgraph outside of / 
is augmented to a clique. Routing in G' 
can only be easier than in G. We show 
a lower bound for G' . 

We want to show that either a node 
in 7 or a node in its neighborhood 

N{i) ^{u\v el ^{u,v} e E] 

is in i7(logn/log(5) categories. Fix a 
system of categories (Cj)i<j<m that al- 
lows greedy routing. For every node v 
in / U N{T) define a pattern as 



1 V e Ci 

V ^CiAN{v)^GiT^^ 
* V ^CiA N{v) n Ci = 



We say two patterns p^^\ p'"-* match if 
they agree on all positions where both 
have a 1 or (i.e., * matches to any- 
thing). Routing is only possible if no 
two patterns match. Let u and v be dis- 
tinct vertices. Greedy routing from u 
to V requires the existence of a neigh- 
bor z of u and a category Q with Ci G 
{ca.t{v) n cat(z)) \cat(u). Then p^^"^ = 1 
and p^"^ = 0. 

To each pattern we assign a region of 
points in the hypercube {0, 1}™, namely 
the set of all matching bitstrings. Ob- 
serve that two patterns match iff their 
regions in the hypercube overlap. In- 
tuitively, this means that for allowing 
greedy routing we need to set many val- 
ues in the p^") to or 1 to make these 
regions small enough to accommodate 
all without overlap. A high number of 
I's forces nodes from I into many cate- 
gories, a high number of O's forces nodes 
from N{I) into many categories. 



To make this into a formal argu- 
ment, consider the cost of a vertex v G I 
defined as 

c{v) :=t^;'^ + -/^\ 

(v) 

where tj. is the number of positions 
in the vector p^"^ equal to k. Note that 
c{v)/2 is a lower bound for 

max I cat (u) I , 

ue{v}UN(v) 

the maximal number of categories u or 
one of its neighbors is in, since each 
1 means an additional category for v 
and each an additional category for 
one of its at most S neighbors. In the 
remainder of the proof we show that 
there is a vertex v G I with c{v) = 
J? (log n/log 6). 

For (5 = 1, an easy argument shows 
an ^?(logn) lower bound. Define a mea- 
sure on the hypercube {0, 1}™ as 

^^(x) = 2—, 

for X e {0, 1}", and by 

for X C {0, 1}". Then the total mea- 
sure of the hypercube is 1. As the re- 
gions defined by the patterns p*-"-* with 
V G I must be disjoint, there must be 
a pattern that has measure fi{p^'"^) no 
more than 1/n. As A*(p^''-*) = 2~'^^'"'> we 
get c{v) > log n. 

The same argument can be applied 
if S is bounded by some constant, as 
then replacing (5 by 1 in the definition of 
c{v) does not change it asymptotically. 

For (5 greater than some large enough 
constant we change the measure on the 
hypercube to 

A(a;) = a^^- (l-ar-^^, 



for an < a < 1 that is to be deter- 
mined, where ^ x denotes the number 
of I's in the bitstring x E {0, 1}™ (and, 
thus, m— ^ X the number of O's). Again 
we sum up for subsets of the hypercube 
and the whole hypercube has measure 
1. The parameter a will incorporate the 
reduced weight of O's in the patterns. 
We want to choose a such that for some 
C > 

a = 2-^ 
(1-a) =2-^/^ 

or, equivalently, we want to find a C 
such that 

1 = 2-<^ + 2-^/^ (1) 

as then again fi{p^'''^) = 2^^-<''\ Now, 
we argue as in the S = I case. Since the 
patterns may not overlap, there has to 
be a pattern with measure at 
most 1/n. As n{p^'''>) = 2-^-'=(^) we get 
c{v) > logn/C, which shows the claim 
assuming C = 0{\ogd). It remains to 
show the latter. 

Unfortunately, we cannot solve Q 
exactly for C. However, for C = 0{S) 
we have 2"^/'' = 1-0(0/6), and, thus, 
for C = log 6 

2-^ + 2-^/^ = l^^^<l, 

for S large enough. On the other hand, 
for C = log (5 — 2 log log S we have 

o-c , o-c/s _ log^S-ejlogS) 

Z +Z - ^ >i, 

for 6 large enough. Hence, there is a 
root C — log (5 — ©(loglogcJ). □ 

This bound is tight. For any triple 
(n, 6, diam) of positive reals with 1 < 
5, diam < n, we can construct a tight 



Figures. Tight example for Theorem [2] 



example with 0(n) vertices, average de- 
gree 0(d), and diameter ©(diam). For 
this construction take a J-Clique Kg 
and connect a set O of n additional 
nodes to all nodes in Kg. Add a path 
P of length diam. For an example see 
Fig. [5) The graph G thus constructed 
has the required parameters. 

We construct a good system of ca- 
tegories for G with a membership di- 
mension matching the lower bound of 
6)(diam(G)-Hlog(n/(5)) from Theorem|2j 
Routing from the nodes on P to all 
other nodes is possible with diam(G) 
membership dimension as in the solu- 
tion for simple paths (see Fig. [ij. In 
the construction we treat the nodes of 
Ks as one, except for the node that is 
directly connected to P and similarly 
the nodes in O. 

To enable routing between the n 
nodes in O, we generalize the construc- 
tion from Lemma [6l We number the 
vertices in O with base 6 (more pre- 
cisely, max{2, [(5]}). Then any vertex 
V € O corresponds to a unique string 
(4"^ , . . . , of length k = log n/ log S 
with each bi S {0, . . . ,S — 1}. For each 
< i < k and < j < S we create a 
category 

G.,r-^{v(^0\b'^^ ^j}U{u,}, 



where ui, . . . ,us are the vertices of the 
cHque Ks ■ Additionally we add a single- 
ton category for each vertex. 

Now, every vertex in the clique and 
O is in at most fc + 1 such categories. 
Moreover, we can route between any 
pair u,v € O, as there is an i with 
^l"-* 7^ b'f\ so category ^(y) allows to 
route from u to the clique when having 
V as target. 

8 Conclusion and Open 
Problems 

In this paper we presented a improved 
construction of systems of categories S 
that support greedy routing in graphs 
G. The previous best method uses a 
membership dimension of C'((diam(G') + 
log |y 1)^), whereas our results show that 

C'(diam(G) log(2 \V\ / diam(G)) 

is sufficient. Besides improved upper 
bounds we also show much stronger 
lower bounds than previously known. 
Our results improve the lower bound 
from diam(G) to 

f?(diam(G) +log|y|/log^) 

for graphs of average degree 6. This 
lower bound is tight for certain graphs. 
For the restricted class of stars of diam- 
eter d and n leaves we showed a lower 
bound of 

n{{dlogn)/{\ogd + loglogn)). 

For the case of diameter two the stronger 
bound 

i7(log n + loglog n) 

holds. 

Many open problems remain. Fore- 
most we conjecture that O{d\og{2n/d)) 



is tight for stars of diameter d, but our 
lower bound is weaker. Also, the best 
upper bounds are achieved by taking 
a spanning tree and constructing cate- 
gories for it. Is it possible to exploit 
the properties of the graph better than 
this? 

Empirical studies show that social 
networks are graphs with a power-law 
degree distribution and a large clus- 
tering coefficient (see e. g. (2l] for an 
overview). The example graphs consid- 
ered in this paper do not have these 
properties. It would therefore be inter- 
esting to study their consequences in 
the minimal membership dimension re- 
quired for routing. Intuitively the mem- 
bership dimension should also depend 
on the expansion properties of the graph. 

From a sociological point of view it 
is also interesting to see how natural 
relaxations of the greedy rule, e. g. al- 
lowing nodes to choose a neighbor at 
random in case of distance ties, influ- 
ence the bounds. 
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Appendix 



Stars 



Here we show the proofs that had to be 
omitted from the extended abstract due 
to length hmitations. For convenience 
we repeat the corresponding Lemmas. 

Simple Bounds 

Lemma. For any graph G and good 
system of categories S we have 

memd(5) > diam(G'). 

Proof 1. Let s e V{G), t e V(G) be a 
pair of vertices such that dist(s,t) = 
diam(G). Consider the path P that a 
message from s to i takes according to 
the greedy routing system. Note that 
\P\ > diam(G'). For every edge (u, v) G 
P we have d{v,t) < d{u,t) — 1, and 
hence 

d(s,t) > \P\ > diam(G). 

By the definition of d(-), this is only 
possible if |cat(t)| > diam(G). 

Lemma. Take M{G) as the minimal 
membership dimension needed to route 
in G. Then 

M(G X H)< M{G) + M{H). 

Proof 2. Let Gg be a minimal category 
system for G and let Gh be a mini- 
mal category system for H . We con- 
struct a system of membership dimen- 
sion M{G)+M{H) for GxH. For every 
c G Gg, we add c x V to the system 
and for every c' G Gh, we add V y. c' 
to the system. 

We route from [u, x) to (v, y) by 
first using the first kind of categories 
to route to {v,x) and then using the 
second kind of categories to route to 

{v,y)- ^ 



Lemma. In a star with £ leaves, the 
center node is contained in at least log £ 
categories. 

Proof 3. Let G be any category con- 
taining the center and at least one leaf, 
but not all leaves. Let k be the num- 
ber of leaves contained in C If fc > £/2 
consider the subgraph containing the 
center and the leaves in G. G cannot 
be used to route between these leaves. 
If fc < £/2 consider the subgraph con- 
taining the center and the leaves not in 
G . Since G contains none of the leaves 
in this subgraph it cannot be used to 
route between any pair of these leaves. 

Let M{£) be the minimal number 
of categories containing the center in a 
star with £ leaves. The paragraph above 
establishes M{£) > 1 + M{£/2). Thus 
M{£)>log£. 

Lemma. Let k be minimal such that 




Then k = log n+ +o(l)) loglog n. 

There is a system of categories S 
that supports routing in stars with n 
leaves and diameter 2 having 

memd(5) = k. 

This bound is tight, i.e., every good sys- 
tem of categories S for this graph has 

memd(5) > fc. 

Proof 4- Let c be the center of the star. 
For any two distinct leaves i and j there 
must be categories C and C" such that 
G, G' € cat(j) \ cat{i), G e cat{c), and 
C" ^ cat{c), as otherwise one cannot 
route from i to j. Let Ci, . . . , Cfc be the 



categories containing the center. For ev- 
ery i define a binary vector Vi of length 
k by 

iv,)k = {ie Ck) for 1 < fc < t. 

The vectors Vi, 1 < i < (- form an anti- 
chain in the set of all binary vectors 
of length k as for every i and j with 
i ^ j , there must he & k with {vi)k = 
and {vj)k = 1. As the maximal size of 
an anti-chain in the set of all binary 
vectors of length k is 



Lfc/2j;' 

the lower bound follows. 

For the upper bound there are n 
distinct bitstrings of length k each con- 
taining exactly [fc/2j ones. Arbitrarily 
assign the strings to the leaves. We have 
k + n categories. The latter n categories 
are singleton sets; they contain one leaf 
each. The former k categories contain 
the center and the leaves for which the 
corresponding bit is one. For any two 
distinct leaves i and j there is a cate- 
gory C containing j and the center, but 
not i, because the bitstrings form an 
anti-chain. We use C to route from i to 
the center and the singleton category 
for j to continue to j. Observe that the 
center is in k categories and each leaf 
is in [A;/2J + 1 < fc categories. 

The equation for k can be derived 
using Stirling's formula. 

Lemma. For any star with I leaves, 
in which every leaf is connected by a 
path of length d to the center, there is a 
system of categories S with membership 
dimension 

memd(5) G O(<ilog^). 

Proof 5. We perform a straightforward 
extension of the solution for stars of 




Figure 6. The d copies of a category. 



diameter 2. Again every leaf u has a 
binary number b{u), encoded as catego- 
ries Oi and Zi, 1 < i < \\ogi~\ . Create d 
copies of these numbers using different 
categories. Category 0'i'\ respectively 

(k) 

Zl , contains all nodes on the path 
from leaves {u \ bi{u) = 1}, respectively 
{u I bi{u) — 0}, to the center, as well 
as all nodes on the remaining paths up 
to a (graph-)distance k from the center, 
see Fig. |6] 

Additional d categories are needed 
for every leaf to route from the center 
down to the leaves, analogous to the 
singleton categories for leaves in the 
small star. 



Lemma. // 4 In i < In £, we have 

k 



' ~ 321ni 
in any covering of Q by t rectangles. 



Proof 6. The proof uses a double count- 
ing technique. For every i, 1 < i < i, 
define a binary vector Vi £ {0, 1}* that 



indicates which 5*^ contain i as 

K)fe = (i e Sk), l<k<t. 

We show that the Vi are pairwise dis- 
tinct and hence there must be at least 
£ such vectors. Consider any i and j 
with i ^ j. Since (i, j) in Q, there must 
be a fc with i € Sk and j € T^. Since 
Sk r\Tk = ^, we conclude j ^ Sk and 
hence {vi)k ^ {vj)k- 

Clearly the total number of ones in 
all Vi equals \Sk\- This number is 
minimized if there is an such that 
all vectors with less than ho ones are 
used, and the remaining vectors contain 
exactly ho ones. Then ho must be such 
that 



< 



h=0 ^ ' h=0 



t- 1 

Thus as 4 In t < In £ by assumption, 



ln£ ln£ 
/lo > 1 > . 



first term. There are two subcases. If 
ho < t/3 and hence for all 1 < h < Hq 



t\ ,/ t \ t + 1 



we have 



h=0 ^ ^ ^ ^ j>0 



hn-ir 



and hence 

ho-l 



h=0 



> 



161nt 

Finally, if Hq > t/3, we bound 

t/4-l 



h=0 



t \ I f t 



< 



2VI + 1 



1 'r^' ft 



2 \h 

h=t/A 



We split the sum into the vectors that 
contain less than ones and the rest. 
Then the number of ones is at least 



ho-l 



h=0 



ho-l 

E 

h=0 



We now distinguish cases. If 

then N > {n/2)ho. Otherwise there are 
more than £/2 vectors with less than 
ho ones and we lower bound A'' by the 



and hence 



ho-l / X ho-l . . t/i-1 

E ^ E - E 

h=t/4 ^ ' h=0 ^ ' h=0 ^ 

ho-l 



- 2 ^ \h 

h=0 

> 



Therefore 



£ t £ho nn£ 
N > - ■ - > — > 



4 4 - 16 - 321nt 



